Overview

Dataset statistics

Number of variables11
Number of observations1000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory133.8 KiB
Average record size in memory137.0 B

Variable types

NUM10
BOOL1

Reproduction

Analysis started2020-10-15 05:30:43.541600
Analysis finished2020-10-15 05:31:25.538645
Duration42 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

WTT has unique values Unique
PTI has unique values Unique
EQW has unique values Unique
SBI has unique values Unique
LQE has unique values Unique
QWG has unique values Unique
FDJ has unique values Unique
PJF has unique values Unique
HQE has unique values Unique
NXJ has unique values Unique

Variables

WTT
Real number (ℝ≥0)

UNIQUE

Distinct count1000
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.9496815136132963
Minimum0.17441166839163802
Maximum1.721779168965468
Zeros0
Zeros (%)0.0%
Memory size55.6 KiB

Quantile statistics

Minimum0.1744116684
5-th percentile0.4808126997
Q10.742357677
median0.9404750904
Q31.163294705
95-th percentile1.422647519
Maximum1.721779169
Range1.547367501
Interquartile range (IQR)0.4209370277

Descriptive statistics

Standard deviation0.2896352517
Coefficient of variation (CV)0.3049814569
Kurtosis-0.5121417265
Mean0.9496815136
Median Absolute Deviation (MAD)0.2104888157
Skewness0.07022360441
Sum949.6815136
Variance0.083888579
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1.23754890110.1%
 
0.72135980810.1%
 
0.659880383410.1%
 
0.876570723210.1%
 
1.56529146210.1%
 
0.870476025210.1%
 
0.953317952210.1%
 
0.562932018410.1%
 
1.10982447310.1%
 
0.809626762110.1%
 
0.620199310710.1%
 
1.18457104710.1%
 
0.651402052510.1%
 
0.649250703110.1%
 
1.03227010810.1%
 
1.02960368510.1%
 
1.51057364110.1%
 
0.661651679710.1%
 
0.809984268810.1%
 
1.12637488210.1%
 
0.600713709910.1%
 
0.924000893210.1%
 
1.03445875610.1%
 
0.730536046210.1%
 
0.966383032210.1%
 
Other values (975)97597.5%
 
ValueCountFrequency (%) 
0.174411668410.1%
 
0.181733741310.1%
 
0.240406264610.1%
 
0.253802068810.1%
 
0.272642582910.1%
 
0.289109100210.1%
 
0.30656449710.1%
 
0.320558974210.1%
 
0.328879505110.1%
 
0.335641109210.1%
 
ValueCountFrequency (%) 
1.72177916910.1%
 
1.67281675610.1%
 
1.6673712310.1%
 
1.65050250810.1%
 
1.642932710.1%
 
1.62646055710.1%
 
1.61393118510.1%
 
1.60988469510.1%
 
1.60684538110.1%
 
1.60545204810.1%
 

PTI
Real number (ℝ≥0)

UNIQUE

Distinct count1000
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.1143025412357455
Minimum0.44139810029598897
Maximum1.8337565522536252
Zeros0
Zeros (%)0.0%
Memory size55.6 KiB

Quantile statistics

Minimum0.4413981003
5-th percentile0.6927410833
Q10.9420706036
median1.118486147
Q31.307904307
95-th percentile1.521919468
Maximum1.833756552
Range1.392358452
Interquartile range (IQR)0.3658337037

Descriptive statistics

Standard deviation0.2570852621
Coefficient of variation (CV)0.2307140589
Kurtosis-0.4065297047
Mean1.114302541
Median Absolute Deviation (MAD)0.1836738127
Skewness-0.1012422881
Sum1114.302541
Variance0.06609283201
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1.30983299310.1%
 
0.81512937310.1%
 
0.842988523210.1%
 
1.07678451210.1%
 
1.07828027510.1%
 
1.31706702410.1%
 
0.687390148310.1%
 
1.36346104810.1%
 
1.00793187410.1%
 
1.32049501910.1%
 
1.00462363410.1%
 
1.30603229710.1%
 
0.648294569610.1%
 
1.07243191310.1%
 
0.992189213810.1%
 
0.837289903810.1%
 
1.20663408510.1%
 
1.39430378110.1%
 
1.44077545210.1%
 
0.536768754110.1%
 
1.27815646110.1%
 
1.41794570210.1%
 
1.02565710210.1%
 
0.983501368410.1%
 
1.03635691910.1%
 
Other values (975)97597.5%
 
ValueCountFrequency (%) 
0.441398100310.1%
 
0.444998248110.1%
 
0.457412951510.1%
 
0.46526424410.1%
 
0.465342360610.1%
 
0.466732742210.1%
 
0.467209751710.1%
 
0.48205042610.1%
 
0.501857133810.1%
 
0.515111402610.1%
 
ValueCountFrequency (%) 
1.83375655210.1%
 
1.80352410810.1%
 
1.75233890710.1%
 
1.74486822610.1%
 
1.74163402710.1%
 
1.71432754510.1%
 
1.7056187110.1%
 
1.69162164910.1%
 
1.68726297710.1%
 
1.6618203310.1%
 

EQW
Real number (ℝ≥0)

UNIQUE

Distinct count1000
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.8341268968602705
Minimum0.1709236280526556
Maximum1.7227247553711322
Zeros0
Zeros (%)0.0%
Memory size55.6 KiB

Quantile statistics

Minimum0.1709236281
5-th percentile0.3987435258
Q10.6154512591
median0.8132641236
Q31.028340048
95-th percentile1.34386783
Maximum1.722724755
Range1.551801127
Interquartile range (IQR)0.4128887884

Descriptive statistics

Standard deviation0.2915538503
Coefficient of variation (CV)0.3495317696
Kurtosis-0.4075914236
Mean0.8341268969
Median Absolute Deviation (MAD)0.2071741209
Skewness0.2972757418
Sum834.1268969
Variance0.08500364765
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.661735021410.1%
 
0.370503123810.1%
 
0.895616313110.1%
 
0.779695672710.1%
 
0.844505574410.1%
 
0.973541893610.1%
 
1.55461893910.1%
 
0.510675650210.1%
 
0.777234075410.1%
 
0.97199769510.1%
 
0.488986338910.1%
 
0.459429044610.1%
 
0.326305539310.1%
 
1.28618895610.1%
 
0.93091689410.1%
 
1.1749231210.1%
 
0.932581853210.1%
 
0.681746463110.1%
 
0.766294481210.1%
 
0.972948480210.1%
 
1.08267912410.1%
 
1.0753404510.1%
 
0.581754381610.1%
 
1.29287938610.1%
 
1.02978980110.1%
 
Other values (975)97597.5%
 
ValueCountFrequency (%) 
0.170923628110.1%
 
0.194365739610.1%
 
0.228332879110.1%
 
0.230481002310.1%
 
0.231524308910.1%
 
0.234100313110.1%
 
0.246518998110.1%
 
0.266846971110.1%
 
0.268541345210.1%
 
0.275855734510.1%
 
ValueCountFrequency (%) 
1.72272475510.1%
 
1.67686225910.1%
 
1.6683824810.1%
 
1.60866978410.1%
 
1.58780416710.1%
 
1.57524840710.1%
 
1.57427591910.1%
 
1.57207452310.1%
 
1.55461893910.1%
 
1.54647372210.1%
 

SBI
Real number (ℝ≥0)

UNIQUE

Distinct count1000
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.6820993715302579
Minimum0.04502666640941666
Maximum1.6348840454364368
Zeros0
Zeros (%)0.0%
Memory size55.6 KiB

Quantile statistics

Minimum0.04502666641
5-th percentile0.3233374095
Q10.5150097868
median0.6768346403
Q30.834316777
95-th percentile1.059452011
Maximum1.634884045
Range1.589857379
Interquartile range (IQR)0.3193069902

Descriptive statistics

Standard deviation0.2296450242
Coefficient of variation (CV)0.3366738539
Kurtosis0.1783451761
Mean0.6820993715
Median Absolute Deviation (MAD)0.1595191286
Skewness0.274569445
Sum682.0993715
Variance0.05273683712
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.657344195610.1%
 
0.693927927510.1%
 
0.423811736610.1%
 
0.736833209810.1%
 
1.19352344710.1%
 
0.41448559410.1%
 
0.595286714310.1%
 
0.814357426610.1%
 
1.04330261710.1%
 
0.574396632310.1%
 
0.465285103310.1%
 
0.473594035410.1%
 
0.572358185610.1%
 
0.255112368410.1%
 
0.500751718910.1%
 
0.614535790710.1%
 
0.37007789210.1%
 
0.836802394910.1%
 
0.526161052110.1%
 
0.748189188810.1%
 
0.675737251110.1%
 
0.631771357710.1%
 
0.347385732610.1%
 
0.350495931410.1%
 
0.657535732910.1%
 
Other values (975)97597.5%
 
ValueCountFrequency (%) 
0.0450266664110.1%
 
0.108985723410.1%
 
0.155745914610.1%
 
0.168035559510.1%
 
0.172179768710.1%
 
0.178991430810.1%
 
0.188277859510.1%
 
0.189835919210.1%
 
0.199500858510.1%
 
0.206418349510.1%
 
ValueCountFrequency (%) 
1.63488404510.1%
 
1.5299165510.1%
 
1.47804137510.1%
 
1.47527422410.1%
 
1.40981092310.1%
 
1.40253294510.1%
 
1.29942083410.1%
 
1.27916376910.1%
 
1.25462201610.1%
 
1.24200440410.1%
 

LQE
Real number (ℝ≥0)

UNIQUE

Distinct count1000
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.0323363284327982
Minimum0.31530700779609955
Maximum1.650049589008639
Zeros0
Zeros (%)0.0%
Memory size55.6 KiB

Quantile statistics

Minimum0.3153070078
5-th percentile0.627152476
Q10.8708551591
median1.035824471
Q31.198270028
95-th percentile1.437827032
Maximum1.650049589
Range1.334742581
Interquartile range (IQR)0.3274148684

Descriptive statistics

Standard deviation0.2434129535
Coefficient of variation (CV)0.2357884216
Kurtosis-0.2495605744
Mean1.032336328
Median Absolute Deviation (MAD)0.1643464036
Skewness-0.02598465764
Sum1032.336328
Variance0.05924986592
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.934954273610.1%
 
0.902597757510.1%
 
0.622502357510.1%
 
1.18336048210.1%
 
1.20763525510.1%
 
1.23272626710.1%
 
1.08251466910.1%
 
1.34770005110.1%
 
1.08618310810.1%
 
0.430105177510.1%
 
0.970686982410.1%
 
1.2656533710.1%
 
1.13308161710.1%
 
0.315307007810.1%
 
0.541189516610.1%
 
0.741128828410.1%
 
0.874874661510.1%
 
1.54912612810.1%
 
1.04042640910.1%
 
0.945636303810.1%
 
0.927422031810.1%
 
1.5520150610.1%
 
0.827341836910.1%
 
1.10136805810.1%
 
1.30782524910.1%
 
Other values (975)97597.5%
 
ValueCountFrequency (%) 
0.315307007810.1%
 
0.358484974510.1%
 
0.408889389510.1%
 
0.430105177510.1%
 
0.434604563110.1%
 
0.440928004510.1%
 
0.458650524510.1%
 
0.465043172110.1%
 
0.475123502810.1%
 
0.476417417610.1%
 
ValueCountFrequency (%) 
1.65004958910.1%
 
1.64887509210.1%
 
1.64120243210.1%
 
1.63404198410.1%
 
1.6316929910.1%
 
1.61474056510.1%
 
1.61214670510.1%
 
1.60924956210.1%
 
1.60878983510.1%
 
1.60861200410.1%
 

QWG
Real number (ℝ≥0)

UNIQUE

Distinct count1000
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.9435343420010476
Minimum0.2623888468883443
Maximum1.6669023520657231
Zeros0
Zeros (%)0.0%
Memory size55.6 KiB

Quantile statistics

Minimum0.2623888469
5-th percentile0.5416434764
Q10.7610635317
median0.9415016708
Q31.123060095
95-th percentile1.380566821
Maximum1.666902352
Range1.404513505
Interquartile range (IQR)0.3619965632

Descriptive statistics

Standard deviation0.2561205966
Coefficient of variation (CV)0.2714480917
Kurtosis-0.4060584667
Mean0.943534342
Median Absolute Deviation (MAD)0.1813122866
Skewness0.06334905949
Sum943.534342
Variance0.06559776001
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.83570477110.1%
 
0.724907982510.1%
 
0.766857543410.1%
 
1.1820868610.1%
 
0.8123981310.1%
 
0.80220850110.1%
 
0.906073533110.1%
 
0.727883621210.1%
 
0.78403367110.1%
 
1.39454913810.1%
 
0.803609544310.1%
 
1.2030409910.1%
 
0.901873061110.1%
 
1.31351274910.1%
 
1.22408398710.1%
 
1.02506823610.1%
 
1.30040888210.1%
 
0.731646829610.1%
 
1.24178937910.1%
 
0.634325347110.1%
 
1.11097319410.1%
 
1.00018746310.1%
 
0.971078390310.1%
 
0.737638637310.1%
 
0.997527072610.1%
 
Other values (975)97597.5%
 
ValueCountFrequency (%) 
0.262388846910.1%
 
0.308015301710.1%
 
0.309297720910.1%
 
0.349889734910.1%
 
0.352607722910.1%
 
0.355471433510.1%
 
0.358254111110.1%
 
0.360038175210.1%
 
0.361896252910.1%
 
0.381962787110.1%
 
ValueCountFrequency (%) 
1.66690235210.1%
 
1.63865945210.1%
 
1.61759744410.1%
 
1.61462577210.1%
 
1.5864973810.1%
 
1.54557077410.1%
 
1.53461870910.1%
 
1.52613932310.1%
 
1.52261497710.1%
 
1.51331961510.1%
 

FDJ
Real number (ℝ≥0)

UNIQUE

Distinct count1000
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.9634218685367859
Minimum0.2952280855806717
Maximum1.7133422293242386
Zeros0
Zeros (%)0.0%
Memory size55.6 KiB

Quantile statistics

Minimum0.2952280856
5-th percentile0.5590828785
Q10.7844066602
median0.9453330074
Q31.134851932
95-th percentile1.403975101
Maximum1.713342229
Range1.418114144
Interquartile range (IQR)0.3504452715

Descriptive statistics

Standard deviation0.2551180291
Coefficient of variation (CV)0.2648040671
Kurtosis-0.1679881805
Mean0.9634218685
Median Absolute Deviation (MAD)0.1715011222
Skewness0.2071411911
Sum963.4218685
Variance0.06508520879
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1.02411967210.1%
 
1.46348428210.1%
 
0.929737127610.1%
 
1.47132598510.1%
 
1.13898333410.1%
 
1.19426902710.1%
 
1.22405549210.1%
 
1.14066720810.1%
 
0.795957383110.1%
 
1.06258034810.1%
 
1.06080368310.1%
 
0.699672891910.1%
 
0.940690246310.1%
 
0.534440185610.1%
 
0.644330269710.1%
 
0.993297533910.1%
 
0.898156917510.1%
 
0.374536902110.1%
 
0.971849066710.1%
 
1.15154912110.1%
 
1.0734632710.1%
 
0.942028947310.1%
 
0.723735831410.1%
 
0.944683593510.1%
 
0.881099000710.1%
 
Other values (975)97597.5%
 
ValueCountFrequency (%) 
0.295228085610.1%
 
0.296317219310.1%
 
0.313241813910.1%
 
0.337510198610.1%
 
0.374536902110.1%
 
0.386802056810.1%
 
0.392739939410.1%
 
0.396660659510.1%
 
0.401617201310.1%
 
0.402798570610.1%
 
ValueCountFrequency (%) 
1.71334222910.1%
 
1.67057419110.1%
 
1.66449315110.1%
 
1.65800700710.1%
 
1.64484768410.1%
 
1.62635071110.1%
 
1.62136650810.1%
 
1.61900048710.1%
 
1.60325823110.1%
 
1.5971984610.1%
 

PJF
Real number (ℝ≥0)

UNIQUE

Distinct count1000
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.0719604990030185
Minimum0.299475657020008
Maximum1.7854196250383634
Zeros0
Zeros (%)0.0%
Memory size55.6 KiB

Quantile statistics

Minimum0.299475657
5-th percentile0.5936087274
Q10.8663056482
median1.065500415
Q31.283155729
95-th percentile1.541567978
Maximum1.785419625
Range1.485943968
Interquartile range (IQR)0.4168500811

Descriptive statistics

Standard deviation0.2889816433
Coefficient of variation (CV)0.269582362
Kurtosis-0.5158956688
Mean1.071960499
Median Absolute Deviation (MAD)0.2108338788
Skewness-0.02348483043
Sum1071.960499
Variance0.08351039015
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1.55422985210.1%
 
1.30644894210.1%
 
0.893462948210.1%
 
1.40943743910.1%
 
0.632906070610.1%
 
1.20739816610.1%
 
0.865773259210.1%
 
1.40031672810.1%
 
1.64269722510.1%
 
0.592119955310.1%
 
1.18076119810.1%
 
1.10379438910.1%
 
1.0360240210.1%
 
0.607161861610.1%
 
1.19551322910.1%
 
1.19169562410.1%
 
1.17099200210.1%
 
0.936012812910.1%
 
1.55400513410.1%
 
1.4478369810.1%
 
1.24516039510.1%
 
0.656041262110.1%
 
0.56844815210.1%
 
1.37522166610.1%
 
0.726722914610.1%
 
Other values (975)97597.5%
 
ValueCountFrequency (%) 
0.29947565710.1%
 
0.319752338310.1%
 
0.369840771110.1%
 
0.373871187110.1%
 
0.383406813810.1%
 
0.389584419510.1%
 
0.409399383910.1%
 
0.413136035110.1%
 
0.423293812210.1%
 
0.42601746410.1%
 
ValueCountFrequency (%) 
1.78541962510.1%
 
1.765841610.1%
 
1.76470062610.1%
 
1.7419189410.1%
 
1.73282629210.1%
 
1.72621177110.1%
 
1.7124297310.1%
 
1.7092144310.1%
 
1.70517553810.1%
 
1.6991868310.1%
 

HQE
Real number (ℝ≥0)

UNIQUE

Distinct count1000
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.158250790498556
Minimum0.36515660986139775
Maximum1.8856900849797629
Zeros0
Zeros (%)0.0%
Memory size55.6 KiB

Quantile statistics

Minimum0.3651566099
5-th percentile0.6591256707
Q10.9343401315
median1.165556198
Q31.383173101
95-th percentile1.614934953
Maximum1.885690085
Range1.520533475
Interquartile range (IQR)0.44883297

Descriptive statistics

Standard deviation0.2937375166
Coefficient of variation (CV)0.2536044171
Kurtosis-0.6934295182
Mean1.15825079
Median Absolute Deviation (MAD)0.2244629606
Skewness-0.1246263155
Sum1158.25079
Variance0.08628172867
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.997082153710.1%
 
0.972364820810.1%
 
0.806633468510.1%
 
0.781611798710.1%
 
0.876336783810.1%
 
0.969811520910.1%
 
1.45954379910.1%
 
0.848357067210.1%
 
1.58867497510.1%
 
1.32472194210.1%
 
1.5076914310.1%
 
0.84295130810.1%
 
1.46777275210.1%
 
1.3682085210.1%
 
0.879422091410.1%
 
0.937735595810.1%
 
1.03136701910.1%
 
1.02249873110.1%
 
1.72043994710.1%
 
1.22429534110.1%
 
1.4729624710.1%
 
1.0581037410.1%
 
1.31166433610.1%
 
1.23228544410.1%
 
1.59012388310.1%
 
Other values (975)97597.5%
 
ValueCountFrequency (%) 
0.365156609910.1%
 
0.386299818110.1%
 
0.399886065210.1%
 
0.406916393410.1%
 
0.421311504610.1%
 
0.506617411210.1%
 
0.510350260710.1%
 
0.515626893410.1%
 
0.51596094910.1%
 
0.522876644510.1%
 
ValueCountFrequency (%) 
1.88569008510.1%
 
1.82758609910.1%
 
1.81930467410.1%
 
1.80436165410.1%
 
1.78544007710.1%
 
1.76813422710.1%
 
1.76292367510.1%
 
1.76110491610.1%
 
1.74975972910.1%
 
1.74929117210.1%
 

NXJ
Real number (ℝ≥0)

UNIQUE

Distinct count1000
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.3627245977228886
Minimum0.639692747423801
Maximum1.8939496030653464
Zeros0
Zeros (%)0.0%
Memory size55.6 KiB

Quantile statistics

Minimum0.6396927474
5-th percentile1.009314364
Q11.222622614
median1.37536799
Q31.504831903
95-th percentile1.677427964
Maximum1.893949603
Range1.254256856
Interquartile range (IQR)0.2822092897

Descriptive statistics

Standard deviation0.2042250234
Coefficient of variation (CV)0.1498652213
Kurtosis0.0121978312
Mean1.362724598
Median Absolute Deviation (MAD)0.1426250013
Skewness-0.2380995959
Sum1362.724598
Variance0.04170786019
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1.3068414210.1%
 
1.4302975610.1%
 
1.47754226210.1%
 
1.55083451710.1%
 
1.60249064110.1%
 
1.44579741110.1%
 
1.37549657410.1%
 
1.66987456410.1%
 
1.41094130310.1%
 
1.4598716110.1%
 
1.44933957410.1%
 
1.4035266210.1%
 
1.17306734510.1%
 
1.49491273710.1%
 
1.10984979610.1%
 
1.48615197810.1%
 
1.44577058410.1%
 
1.23387149310.1%
 
1.55570778710.1%
 
1.31247833410.1%
 
1.41812754210.1%
 
1.52548064610.1%
 
0.984224129610.1%
 
0.963166999410.1%
 
1.36104790310.1%
 
Other values (975)97597.5%
 
ValueCountFrequency (%) 
0.639692747410.1%
 
0.641499348910.1%
 
0.703325053710.1%
 
0.725823297810.1%
 
0.74104773710.1%
 
0.772636632410.1%
 
0.785330341610.1%
 
0.832947941110.1%
 
0.880408451310.1%
 
0.883111021510.1%
 
ValueCountFrequency (%) 
1.89394960310.1%
 
1.89301431710.1%
 
1.89076179210.1%
 
1.88867522610.1%
 
1.86505951810.1%
 
1.85390278510.1%
 
1.85132400210.1%
 
1.83022592310.1%
 
1.82928495610.1%
 
1.81314360310.1%
 
Distinct count2
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size55.6 KiB
1
500
0
500
ValueCountFrequency (%) 
150050.0%
 
050050.0%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

Sample

First rows

WTTPTIEQWSBILQEQWGFDJPJFHQENXJTARGET CLASS
00.9139171.1620730.5679460.7554640.7808620.3526080.7596970.6437980.8794221.2314091
10.6356321.0037220.5353420.8256450.9241090.6484500.6753341.0135460.6215521.4927020
20.7213601.2014930.9219900.8555951.5266290.7207811.6263511.1544830.9578771.2855970
31.2342041.3867260.6530460.8256241.1425040.8751281.4097081.3800031.5226921.1530931
41.2794910.9497500.6272800.6689761.2325370.7037271.1155960.6466911.4638121.4191671
50.8339281.5233021.1047431.0211391.1073771.0109301.2795381.2806770.5103501.5280440
60.9447051.2517611.0748850.2864730.9964400.4288600.9108050.7553051.1118001.1108420
70.8161741.0883920.8953430.2438600.9431231.0451311.1465361.3418861.2253241.4257840
80.7765511.4638120.7838250.3372780.7422151.0727560.8803001.3129511.1181651.2259220
90.7722800.5151110.8915960.9408621.4305680.8858761.2052310.5968581.5425800.9818791

Last rows

WTTPTIEQWSBILQEQWGFDJPJFHQENXJTARGET CLASS
9900.8761120.9424141.0606051.4780410.8187731.4736351.3063641.2973860.5228771.2863940
9911.1026121.0071630.5350510.6332200.7367910.8646631.0801281.2307311.1804971.6774091
9920.8096271.6027000.9909450.6499331.1188830.8998370.9191171.6088920.9786161.2756210
9930.7336871.0496360.7291940.8515121.5520150.9544500.4694260.8621351.4648021.0887591
9941.2126500.8390620.4560120.7734201.0912100.7943780.7366211.1623771.5127561.4151681
9951.0109531.0340060.8531160.6224601.0366100.5862400.7468110.3197521.1173401.3485171
9960.5755290.9557860.9418350.7928821.4142771.2695401.0559280.7131930.9586841.6634890
9971.1354700.9824620.7819050.9167380.9010310.8847380.3868020.3895840.9191911.3855041
9981.0848940.8617690.4071580.6656961.6086120.9438590.8558061.0613381.2774561.1880631
9990.8374600.9611840.4170060.7997840.9343990.4247620.7782340.9079621.2571901.3648371